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Abstract 

A simple model of random Brownian walk of a spherical mesoscopic particle in viscous liquids is proposed. 
The model can be both solved analytically and simulated numerically. The analytic solution gives the known 
Einstein- Smoluchowski diffusion law (r^) — Dt where the diffusion constant D is expressed by the mass and 
geometry of a particle, the viscosity of a liquid and the average effective time between consecutive collisions 
of the tracked particle with liquid molecules. The latter allows to make a simulation of the Perrin experiment 
and verify in detailed study the influence of the statistics on the expected theoretical results. To avoid the 
problem of small statistics causing departures from the diffusion law we introduce in the second part of 
the paper the idea of so called Artificially Increased Statistics (AIS) and prove that within this method of 
experimental data analysis one can confirm the diffusion law and get a good prediction for the diffusion 
constant even if trajectories of just few particles immersed in a liquid are considered. 

1 Introduction 

Recently a big progress has been made in application of digital technique in experimental physics what allows to 
perform milestone physics experiments even in student laboratories. A good example is the Perrin experiment [1] 
considered as the first one directly proving the atomic structure of matter. However, its verification at university 
laboratories [2], [3], [4], [5] due to small statistics one takes, may meet some difficulties (see e.g. [2], [4], [5]). The 
linear dependence between the average square displacement (r^) of the particle in media due to its Brownian 
motion and the observation time t as required by the Einstein-Smoluchowski diffusion law often becomes very 
problematic. 

It is essential therefore to examine the minimal statistics (number of tracked particles) one should take into 
account in the limited observation time to reveal the major feature of the diffusion law. We propose the 
analytical model which can be also easy simulated numerically. The aim of this model is to investigate how the 
results of (r^) versus t depend on the statistics and what the scaling range of the expected linear relationship 
is. This study should help to set up the experiment properly as well as to analyze the obtained results more 
correctly. 

In the next section we present the model directly reflecting the physics standing behind the Perrin experiment. 
In section 3 the results of numerical simulation of this model are described and main features of diffusion relation 
with its scaling range are revealed for various number of particles to be observed. To avoid the problem of small 
statistics causing departures from the strict power law behavior we introduce and discuss the idea of Artificially 
Increased Statistics(AIS) in section 4. This method is then applied both to the results of numerical simulation 
and to some experimental data. We argue the method may significantly decrease the level of statistical noise 
in data, leading to much better agreement with the linear dependence in diffusion theory. In the last section 
summary of obtained results is given. 



2 Description of the Model 



The most popular derivation of the diffusion law in media with viscosity comes from Einstein [6] , Langevin [7] 
and Smoluchowski [8]. Here we propose another approach based on the time series analysis combined with the 
average time r between consecutive collisions of the tracked mesoscopic particles with other particles in liquid 
(i.e. r has the meaning of the average time between collisions which significantly change the motion of the 
tracked object). Such approach seems to be closer to the spirit of the original Perrin experiment [1]. 
Let the trajectory of the observed particle of mass m moving in d-dimcnsional space is x"(t), where a = 1, 2, d. 
We assume x°'{t) to be discrete d-dimensional time series with constant spacing r in time {t = 0, r, 2t, Nt). 
The obvious notation 

a;«(fcT)=a;^,fc = l,2,...,iV (1) 

and 

Axt=xt+^-xt (2) 

will be applied, where Ax^ is the instantenous displacement of the particle a,t t = kr. 

For the stationary, integer Brownian motion (no displacement correlation) with no drift one has for large n: 

(Axr)n = (3) 

and 

(Ax?Axp„ = %«)^ (4) 

where (.)n is the average taken over the ensamble of n tracked particles and (cr")^ = is the standard deviation 
of instantenous displacements, i.e.: 

{{Axf)X^'r' (5) 

The total mean squared displacement (r^)„ of particles from their initial positions after A'' collisions can be 
easy calculated with the help of Eq. (5): 

d N ,2 
a=l i 

In order to calculate cr^ let us notice that 

Axr=r{vf)r (7) 
with {v")t being the average velocity of the i - th particle between collisions. Hence from Eqs. (5) and(7): 

ct'=t\{v^)% (8) 

The equipartition theorem establishes the connection of microscopic quantities with the absolute temperature 
T and the Boltzmann constant k: 

lm{{vr)l)„ = \kT (9) 



Therefore Eq. (6) reads: 



(Ar2)„ = (^r)t (10) 
m 



The above formula is the standard diffusion law with the diffusion constant 



D = r (11) 

m 



expressed in terms of r. 

Usually one writes D in terms of liquid viscosity r] as 



(12) 



where a = dngr] (Stokes law) and g being the radius of the considered mesoscopic particles. 

Hence one gets the simple relation between parameter r in the model and macroscopic quantities m, a: 

r=^ (13) 

a 

Thus the model reproducing the known diffusion law also estimates the average time r lapsing between 
consecutive collisions in the system as the simple function of macroscopically measured quantities. This time 
can be taken as the input parameter in the numerical study of the Perrin experiment what is done in the next 
section. 



3 Numerical Simulation of the Perrin Experiment 

The solution in Eq. (10) can be checked via numerical simulation of the Brownian motion in viscous media. In 
fact this simulation is the only way one can find the sufficient statistics, i.e. the number of tracked particles in 

the ensamble one should observe in real experiment to obtain results confirming the linear relation. If sufficient 
statistics requirement is not satisfied, one observes significant departures from the linear behavior (r^)„ ~ t (see 
e.g. Ref. [2], [4]). 

We simulated all time series {xf} in d = 2 dimensions usually discussed by experimentalists. The time series 
were built in the well known iterative way 

xf+,=xf + Axf (14) 

= [xlf + (x'S (15) 
where displacements have been generated as the random gaussian numbers N{0^ a), with the standard deviation 

1 /2 

a = T{kT/m) ' obtained from Eqs. (8), (9). All simulations were performed for the case of diffusion in pure 
water (77 = 1.00 x lO"'' Pa ■ s), room temperature T = 295 K, m = 4.28 x 10"-^^ kg and g = 425 nm what 
roughly corresponds to the real Perrin experiment parameters. 

The essential task to be done just in the beginning was to determine the scaling range A of the discussed 
linear dependence as a function of the number of tracked particles n. It was done for the bunch of simulated 
trajectories varying the number of observed particles in the range n = 10-^500. The bunch of twenty trajectories 
was investigated for any n in the above range. The examples of just five runs in each bunch (for the clarity of 
figure we do not show all the runs) are pictured in Fig.l a-d. Hence we have found the scaling range relation 
revealed in Fig. 2. The best fit gives 

A ~ n'^ (16) 
where /3 = 0.51 ± 0.04 and the uncertainty comes from the statistics. 

Let us notice that if the number of observed particles docs not exceed 10 the linear dependence ^ t can 

be confirmed only for the observation time t < 3s! It makes the analysis taking into account longer observation 
times (as authors of Ref. [4] did), simply incorrect. 

Having the scaling range determined we may proceed to calculate the diffusion constant value and its expected 
standard deviation from the mean. Such analysis was done by us for the simulated trajectories mentioned above. 
Some chosen cases (again for the clarity of graph we do not show all of them) with maximal and minimal values 
of D for every n are shown in Fig. 3 a-d. All results of the mean D values and their standard deviation as the 
function of n are presented in Fig. 4. 

Hence we see that the final result within 10% percent of the expected theoretical value can be found only if 
one considers the ensamble of n > 50 particles. 



4 Analysis of Results with Artificially Increased Statistics 

The results of the previous section seem to suggest that to get a reasonable agreement with the diffusion law 
predictions one should take into account in the real experiment data from at least n ~ 50 particles. In many less 
professional labs (e.g. student labs) such a requirement is virtually impossible to be satisfied - mainly because 



of the limited time duration of the data collection if no sophisticated computerized apparatus is used. Below 
we give the idea that helps to overcome such a difficulty. We call it Artificially Increased Statistics (AIS). 
The main idea of AIS is to build the statistics of consecutive displacements from the very small number of 
available trajectories, counting all the displacements not from the initial starting point (xJjXq) — (0,0) but 
varying it along the whole one particle trajectory. Thus any momentary position of the particle, say (x^, x'^),k = 
1,2,...,N is the starting point to collect statistics of all displacements afterwards, i.e. (Aa;;^_^, Aa;^_^), Z > k. 
where Aa;"_j, = xf — x"^ is the a-th part of the {I — k) step displacement. This way for the time series of Icnigth 
TV we have N — m data for m-steps displacements instead of just one displacement usually taken into account. 
Then the statistics is averaged in the usual way over the all considered (observed) particles. This way even if 
n is small the overall number of data entering the statistics is large enough to fulfil the linear law expectation. 

Let us now look at the results of the; application of AIS to the simulated Brownian motion as well as to the 
pure experimental data from the real experiments. 

In Fig.5a-b we present the bunch of squared displacements in time taken for the statistics of n = 10(a) and 
n = 5G(b) particles worked out with the AIS procedure. The comparison with the "naked" data from Fig.3a-b 
shows the tremendous difference. Although the scaling range after AIS lifting does not seem to change a lot, 
the linear dependence < >~ f is now much more convincing. In fact the comparison of diffusion constants 
D obtained from the " naked" analysis and from the data lifted by AIS shows about 7 times smaller uncertainty 
in D evaluation in the case of n = 10 statistics (see Table 1). The corresponding result for the n = 50 case is 

Table 1: The comparison analysis of diffusion constant values found as the best fit before and after the AIS 
procedure. Each item is taken from different simulation of the n = 10 or n = 50 particles run. The scaling 
range is fixed according to Fig; la.}) with the sampling; time interval At = lOV O.Ol.s. 





D{fim^ s ^) 


n=10 particles 


n=50 particles 


Run nr 


Before AIS 
("naked" data) 


AIS data 


Before AIS 
("naked" data) 


AIS data 


1. 


1.04 


1.83 


1.92 


1.92 


2. 


2.07 


1.76 


1.90 


1.95 


3. 


1.28 


1.90 


1.88 


1.95 


4. 


1.65 


2.15 


2.29 


1.83 


5. 


3.23 


2.00 


2.24 


2.16 


6. 


0.96 


1.92 


2.00 


2.03 


7. 


2.73 


1.95 


1.73 


2.05 


8. 


4.01 


2.23 


1.74 


1.86 


9. 


1.70 


1.82 


2.64 


2.00 


10. 


1.46 


1.99 


1.99 


1.89 


<D> 


2.00 


1.96 


2.03 


1.96 




1.0 


0.15 


0.28 


0.10 



improved about 3 times. 

We have calculated also the mean absolute error (MAE)defined as 

1 ^ 

6MAE = J,J2\^k-^th\ (17) 
fe=l 

where Dth is the theoretical value of the diffusion coeflacient {Dth = 2.01/im^/s for the considered diffusion 
process) and the sum is taken over all simulated runs. 

For the sample of 10 runs with and without AIS one obtains for n = 10 particles 6MAE{n = 10) = 0.80 iJ.m'^s~^ 
decreasing to 5fJ^^{n = 10) = 0.13 /im^.s"-'^ when AIS is switched on. The corresponding result for n = 50 
particles are (in funds' "^) SMAE{n = 50) = 0.20 and S^%{n = 50) = 0.09 respectively. 

The positive feature of AIS procedure can also be seen directly with the pure experimental data. We show in 
Fig. 6a the data taken in Ref. [4] for the diffusion of n = 5 latex spherical particles in the pure water. One gets 



much better correspondence with the Unear dependence when AIS procedure is appUed to these experimental 
points, what is clearly revealed in Fig. 6b. The obtained best fit for the diffusion constant corresponds now 
closely to the expected theoretical value Dth = 2.01 ^m?s~^ what is not the case of the fit obtained by authors 
of Ref. [4]. 

5 Conclusions 

The proper determination of the scaling range for the linear dependence (r^) ~ t is crucial in the data analysis. 
We argued this scaling range behaves like \ ^ , where the constant (3 was determined as (i ~ 0.5. The 
numerical simulation shows that for the case of mesoscopic particles diffusing in water the scaling range for 
n 10 particles is as short as A < 3s. For n < 10 this scaling range is difficult to determine at all. In many 
papers this fact is ignored what gives misleading results. 

However, even if one remains in the scaling range regime the results of simulated runs are not always statistically 
repeatable if too small statistics is considered. The minimal number of tracked particles to reveal the diffusion 
law is n > 50. One may nevertheless find the reasonable correspondence between theoretical predictions and 
experimental results even for the smaller number of tracked particles if the idea of AIS is applied. In this paper 
we have described this idea and have shown how it works for simulated data as well as for data taken from the 
real experiments. It turns out that with AIS analysis one may get results within 10% of the expected theoretical 
value of the diffusion constant tracking just few mesoscopic objects. The corresponding input data without AIS 
gives much bigger uncertainty of the order of 50% (see Fig. 4). The same applies when MAE is calculated. To 
decrease the uncertainty to the former level of 10% one has to track roughly ten times more objects! 
We have checked that for n = 50 tracked particles AIS procedure decreases the statistical uncertainty in D 
from about 15% (the "naked" data case) to ^ 5%. Simultaneously the 6m ae drops down about twice (from 
0.20 iJiw?s~^ to 0.09 iJL'm?s~^). The AIS procedure is here less impressive than for n = 10 case but still shows 
the significant improvement in data results. 

This way it is quite possible to collect data giving very good prediction for the diffusion constant even in less 
professional labs where one is not able to measure simultaneously signals coming from bigger number of objects. 
Hence, other important physical constants (like e.g. Boltzmann constant k or the Avogadro number Na) can 
be deduced with high accuracy what is often the crucial point of such experiments. 

The simulations were done by us also for liquids with other viscosities. The same final conclusions as for the case 
of water can be formulated. Because of very similar results we did not show them explicitly in this paper but 
we believe they should be studied in the way of numerical simulation in any case before the actual experiment 
is planned. 
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Figure 1: Examples of (r^) dependence versus t for the simulated runs of n = 10 (a), n = 50 (b), n — 160 (c) 
and n = 320 (d) particles. The scaling range of the linear dependence (r^) ~ Ms marked as the vertical line in 
each case. 
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Figure 2: The scaling range dependence on the number of tracked particles n. The error bars correspond to 
statistical uncertainties in determination of A for given n. All data come from the numerical simulations. 





Figure 3: The range of diffusion constants obtained for n = 10(a), n = 50 (b), n = 160(c) and n = 320(d) 
tracked particles in numerical simulation. 




Figure 5: The bunch of (r^) ~ t for n = 10(a) and n = 50(b) particles worked out with AIS. The enlarged 
scaling range regime is shown separately. 




Figure 6: The result of AIS procedure applied to the real experimental data of n = 5 latex sphcaical particles 
diffusing in pure water (data taken from Ref. [4]). Fig. 6a represents the "naked" results, while Fig. 6b shows 
the results lifted by AIS. 



